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Abstract 

Proper differentiation of naVve T helper cells into functionally distinct subsets is of critical importance to human 
health. Consequently, the process is tightly controlled by a complex intracellular signalling network. To dissect the 
regulatory principles of this network, immunologists have early on embraced system-wide transcriptomics tools, 
leading to identification of large panels of potential regulatory factors. In contrast, the use of proteomics approaches 
inT helper cell research has been notably rare, and to this date relatively few high-throughput datasets have been 
reported. Here, we discuss the importance of such research and envision the possibilities afforded by mass 
spectrometry-based proteomics in the near future. 
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INTRODUCTION 

CD4+ T helper (Th) cells form a vital regulatory 
component of the adaptive immune system. Their 
functional diversity is reflected in the number of 
alternative CD4+ T cell subsets arising from a 
common naive T cell progenitor. Currently 
known subset repertoire includes Thl, Th2, Thl7 
and iTreg cells, along with several recently dis- 
covered and less thoroughly characterized subsets, 
such as Tfih, Th9 and Th22 cells. The differentiation 
and functional properties of Th subsets have been 
subjects of several recent comprehensive reviews 
[1—4]. Th cell differentiation is initiated by engage- 
ment of the T cell receptor and binding of costimu- 
latory ligands and regulated primarily by the local 
cytokine micro environment. Cytokine signals are 
mediated by Signal Transducer and Activator of 
Transcription family of proteins, which together 



with T cell receptor signaling activate subset-charac- 
teristic transcription factors, most importantly T-bet, 
Gata3, RORc and Foxp3, promoting the develop- 
ment of Thl, Th2, Thl7 and iTreg cells, respectively 
(Figure 1). 

Importantly, deviant Th cell activity is implicated 
in numerous autoimmune and allergic diseases, 
prompting research into the molecular underpin- 
nings of Th effector subset development [5]. This 
research has relied heavily on the use of transcrip- 
tomics methods, namely DNA microarrays and more 
recently next-generation sequencing (NGS). In con- 
trast, studies of Th differentiation using proteomics 
approaches have been relatively scarce. System- wide 
protein analysis has remained technically challenging 
due to the biochemically complex and dynamic 
nature of proteomes. Unlike for nucleic acids, 
there are no enzymatic amplification methods for 
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Figure I: A simplistic model of the subset differentiation of T helper cells. The differentiation is initiated by contact 
with cognate antigen peptide-MHC complex on the antigen presenting cell. The direction of differentiation is largely 
determined by cytokines, the effects of which are relayed by specific STAT proteins. The STATs, together with 
downstream lineage-specifying transcription factors, orchestrate subset-specific gene expression programs, includ- 
ing production of hallmark cytokines. 



peptides or proteins, leading to a need for relatively 
large sample amounts. Moreover, proteins are pro- 
duced over a wide range of abundances (from dozens 
to millions of copies per cell), and are subject to 
numerous variations including alternative isoforms, 
proteolytic cleaveage and post-translational modifi- 
cations (PTMs). Consequently, proteomics studies 
have required highly specialized instrumentation, 
data analysis pipelines and technical expertise. Avail- 
ability of such resources has further limited the adop- 
tion of proteomics technology by immunologists. 
Here, we review the findings from recent studies 
where proteomics approaches have been applied to 
elucidate the processes involved in Th subset differ- 
entiation. In addition, we discuss the current limita- 
tions of such approaches and aim at foreseeing 
imminent development in this field. For a more 
thorough introduction to technical aspects of prote- 
omics, the reader is referred to recent excellent re- 
views [6—9]. 

TRANSCRIPTIONAL 
REGULATORY MECHANISMS 
LEADING TO SUBSET-SPECIFIC 
GENE EXPRESSION 

Early adoption of the Affymetrix technology by two 
groups rapidly led to identification of new players 



involved in Thl and Th2 differentiation [10—14]. 
Follow-up of these profiling studies have revealed, 
for example, the functional roles of ATF3, SATB1, 
PRELI and PIM kinases in human Th cell differen- 
tiation [15—18]. More recently, detailed kinetics of 
gene expression at early stages of mouse and human 
Thl 7 differentiation was studied by genome- wide 
micro arrays [19, 20]. In addition, genome- wide stu- 
dies on mice deficient for STAT4 and STAT6 clearly 
demonstrated their essential role in directing Thl 
and Th2 differentiation [12, 13, 21, 22]. Altogether, 
these studies lead to identification of hundreds of 
genes expressed differentially in response to cyto- 
kine-STAT signalling [23]. To further distinguish 
which genes are direct targets of STATs, chromatin 
immunoprecipitation (ChlP)-Seq has been used to 
map the binding sites of STAT4, STAT6, STAT3 
and STAT5 in response to IL-12, IL-4, IL-6 and/or 
IL-21, IL-2 stimulation using both wild type and 
STAT-deficient T cells [21, 22, 24-28]. Although 
it is well established that the STATs are important 
for cytokine signalling and initiation of differenti- 
ation processes, these studies established that even 
in differentiated Thl, Th2, Thl7 or iTreg cells, nu- 
merous STAT binding sites were mapped to the sig- 
nature cytokine loci. ChIP has also been applied to 
identify targets of the hallmark transcription factors 
T-bet and GATA3 in Thl and Th2 cells. 
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Interestingly, a set of genes including IL4 and IFNQ 
is bound by GAT A3 in both Thl and Th2 cells and 
also targeted by T-bet in Thl cells, supporting their 
opposite regulatory roles in Thl/Th2 fate decision 
[29]. 

However, identification of binding sites of tran- 
scription factors does not alone indicate functional 
impact of such binding. To address this question, 
further maps of transcription factor binding have 
been combined with detection of chromatin modi- 
fication markers such as histone H3 lysine 4 (H3K4) 
and lysine 27 (H3K27) trimethylation, enriched in 
promoters or repressors, respectively, as well as ex- 
pression levels of neighbouring genes [21, 30]. For 
example, H3K4me3 and STAT4, STAT6 or STAT3 
are found to be associated with Ifng, 114 or 1117, re- 
spectively, in differentiated Thl, Th2 and Thl 7 lin- 
eages [21, 22, 24]. Interestingly, STAT proteins 
appear to induce gene expression in one lineage 
and inhibition of associated active epigenetic marks 
in another lineage. In the case of GAT A3 and T-bet, 
these factors seem to regulate both expression of Th2 
or Thl lineage-specific genes as well as their active 
and repressive histone marks [30]. 

Furthermore, active enhancer landscapes have 
been mapped in Thl and Th2 cells [31, 32]. 
Consensus motifs for STAT1 and STAT4 are en- 
riched in Thl enhancers, whereas STAT6 motifs 
are found in Th2 but not Thl enhancers. Binding 
of these STATs can activate lineage-specific enhan- 
cers and suppress enhancers associated with alterna- 
tive cell fates [31]. 

To get an insight into human diseases it is neces- 
sary to carry out studies in primary human cells in 
addition to mouse studies. Genome-wide association 
studies (GWAS) have identified a large number of 
single-nucleotide polymorphisms (SNPs) that are 
associated with autoimmune diseases. The majority 
of these SNPs are, however, in the non-coding re- 
gions of DNA. To understand how these SNPs 
might contribute to disease pathogenesis, it is crucial 
to find out what is the function of such regions of 
non-coding DNA. A recent integrative study on 
human cells that included analysis of chromatin 
state maps, transcription factor-binding sites and 
modelling of enhancer-gene pairs combined with 
GWAS studies of autoimmunity-associated SNPs 
suggest a direct role for distal regulatory elements 
in disease aetiology. Such SNPs were shown to 
alter binding of transcription factors involved in Th 
cell differentiation [32]. Extension of such studies 



that map the landscape of regulatory SNPs in 
human T cell biology will be vital in order to 
better dissect and understand the molecular mechan- 
isms of immune-mediated diseases in man. 



FROM TRANSCRIPTOMES TO 
SUBSET-SPECIFIC PROTEOMES 

Transcriptional control mechanisms only account for 
part of the regulation of gene expression. In fact, 
only ^40% of variation in protein quantities can 
be explained by variation in mRNA quantities 
[33]. Thus, for a conclusive protein-level view, 
direct proteomics analysis is required. Differentially 
expressed proteins can be identified by direct mass 
spectrometric analysis after sufficient fractionation, or 
based on visual comparison of two-dimensional elec- 
trophoresis (2DE) patterns. Quantification can be 
performed using isobaric labelling or so-called 
label-free approaches a relying on careful alignment 
of extracted ion chromatograms [7] . 

However, while current state-of-the art transcrip- 
tomics afford complete system-wide detection of 
analytes, proteomic techniques have only recently 
begun to approach comparable level of coverage. 
Thus, the detection of proteins is typically limited 
and biased towards the higher abundance molecules. 
Accordingly, relatively few proteins have been iden- 
tified as differentially regulated in response to 
Th-polarizing cytokines. Furthermore, such studies 
have often failed to recapitulate known expression 
patterns of subset-specific hallmark molecules, such 
as transcription factors and cytokine receptors, often 
present at low copy numbers (100—1000 copies per 
cell) [34]. 

The global effects of IL12 signalling have been 
investigated in both early (24 h) and late stages 
(7 and 14 days) of human Thl differentiation 
(Figure 2) [35, 36]. The lists of IL12-regulated pro- 
teins in these different time points showed little 
overlap, likely resulting from incomplete coverage 
achieved in each study as well as the temporally dy- 
namic nature of Th subset differentiation. For ex- 
ample, Cyclophilin A was detected as up regulated 
in Thl -promoting conditions at Day 1 but no 
longer at Days 7 or 14. Nevertheless, both studies 
were successful in detecting known Thl -associated 
proteins. At 24 h, programmed cell death 4 
(PDCD4), a known IL12 target, was measured as 
2.5-fold induced. At the later time points, the 
IL12-induced proteins were enriched for known 
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Total proteome 
-Th1 (35,36) 
-Th2 (36,38,39) 



Global phospho- 
proteome 
-ThO (65,66,67) 




TCR raft proteome 
-ThO (58) 



Nuclear 
proteome 
-Th2 (51) 




Transcription factor interactomes 
-T-bet (78, 79) 
-GATA3 (76, 78) 
-FOXP3 (77) 



Membrane 
proteome 
-Th1(42, 46) 
-Th2 (42, 48) 



Chromatin and 
nuclear lamina 
associated 
J proteome 
■ThO (57) 



Figure 2: Proteomics approaches to elucidate processes leading to the development of T helper effector subsets. 
ThO, Thl and Th2 refer to experimental conditions where cells were activated (ThO) and cultured in presence or 
absence of cytokines promoting differentiation of Thl orTh2 cells. 



targets of interferons, including interferon-induced 
protein 35 (IFP35), ISG15 ubiquitin-like modifier 
(UCRP) and Tryptophanol-tRNA synthetase. In 
fact, several of these proteins had been earlier iden- 
tified as direct targets of interferon-Qf in a similar 
experimental model [37]. These findings illustrate 
the importance of positive feedback delivered by 
autocrine and paracrine signalling during Th 
differentiation. 

Proteomic responses to the Th2 -promoting IL4 
signalling have been studied both in human [36, 38] 
and by using Stat6-/- mice [39]. As with IL12, the 
resulting data were of limited coverage and contained 
many structural or otherwise highly abundant pro- 
teins. In the context of Th2 differentiation, the most 
interesting observations were upregulation of core 
binding factor subunit beta, isoform 2 (CBFb2) and 
cellular nucleic acid binding protein (CNBP). Both 
proteins are implicated in regulation of gene expres- 
sion. CNBP can regulate gene expression at both tran- 
scriptional and translational levels. CBFb2, in turn, is a 
co-factor for Runx family transcription factors. 
Notably, RUNX1 is an inhibitor of GAT A3 expres- 
sion and a direct target of STAT6 [28, 40]. 
Furthermore, the RUNX1— CBFb complex has been 
shown to be necessary for Treg development [41]. 

The above mentioned results were obtained by 
the use of 2DE, which is limited in its ability to 
resolve poorly soluble hydrophobic membrane 



proteins. In a complementary study by Loyet et al. 
[42], plasma membrane preparations from human 
Thl and Th2 cells were compared (at 7 days post- 
activation). The authors identified bone marrow 
stromal protein 2 (BST2) and T cell receptor inter- 
acting molecule (TRIM), among the proteins most 
upregulated in Thl conditions. BST2, also known as 
tetherin, plays a role in lateral organization of plasma 
membrane microdomains [43]. TRIM is associated 
with some, but not all T cell receptor complexes and 
plays a role in thymic selection [44, 45]. The exact 
role of these proteins in Th differentiation remains to 
be elucidated. By targeting membrane-derived 
microsomal fractions, Filen et al. [46] found two 
physically interacting proteins, galectin-1 and CD7, 
to be downregulated by IL12. Galectin-1 has since 
been reported to be produced by Th2 cells, promot- 
ing production of Th2 cytokines and antagonizing 
Thl survival [47]. Conversely, in response to IL4, 
GTPase of the immunity-associated protein 4 
(GIMAP4) was found to be downregulated [48]. 
Furthermore, transcriptomics analysis revealed that 
GIMAP4 is upregulated in Thl cells. Members of 
the GIMAP family are expressed in specific patters 
during T cell development, with expression of 
GIMAPS 3, 4, 5 and 7, being increased during the 
transition from the double positive to the single posi- 
tive stage. However, deletion of GIMAP4 does not 
impair T cell generation, but instead seems to be 
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involved in caspase-3-mediated apoptosis [49, 50]. 
Importantly, GIMAP4 seems to regulate cytokine 
production by Th cells (Heinonen etal, unpublished 
observation) . 

Besides absolute abundance, protein activity can 
be regulated by differential localization in subcellular 
compartments. Targeted studies of subcellular frac- 
tions can thus provide biologically meaningful infor- 
mation beyond the context of global gene 
expression. In particular, transcription factors are 
frequently inducibly recruited to or retained in the 
nucleus in response to specific signals. During Th2 
differentiation, phosphorylated STAT6 localizes 
preferentially to the nucleus, leading to higher rela- 
tive increase in concentration than suggested by the 
IL4-induced increase in the level of the STAT6 
mRNA. To identify other IL4-regulated proteins 
potentially involved in transcriptional or epigenetic 
regulation, nuclear fractions of human Th2 cells have 
been analysed using iTRAQ-based quantitative 
proteomics [51]. The resulting list of 30 proteins, 
included two IL4-induced transcription factors, Y 
box binding protein 1 (YBX1) and Ikaros (IKZF1), 
not belonging to the canonical Th2 or Thl path- 
ways, although the latter has been recently shown to 
directly inhibit expression of T-bet and activate 
expression of Th2 cytokines [52, 53]. 

In some of these studies, differential expression 
was also investigated at the RNA level using either 
microarray datasets or quantitative RT— PCR. 
Rautajoki etal. [36] observed a correlation between 
protein abundance changes and microarray data in 6 
out of 11 cases. Differential regulation of membrane- 
associated BST2 and TRIM was not observed at the 
mRNA level [42]. In the study of IL4-responsive 
nuclear proteins by Moulder et al. [51], significant 
transcriptional regulation was detected in three out 
of eight cases. Altogether, such comparisons are 
complicated by considerations of subcellular localiza- 
tion, extracellular secretion, temporal separation of 
transcription and translation and uncertainty asso- 
ciated with assignment of peptides to alternative iso- 
forms. In mammalian cells, protein abundance has 
been shown to be predominantly regulated at the 
level of translation. Protein copy numbers correlate 
strongly with the rate of translation, with proteins of 
highest abundance translated at least 100 times more 
efficiently than the least abundant proteins [34, 54]. 

Translation efficiency is in part regulated by eu- 
karyotic translation initiation factors (elF). 
Interestingly, eIF4E, regulator of mRNAs involved 



in processes including cell cycle, innate immunity 
and apoptosis, is expressed at a higher level in acti- 
vated Foxp3- than Foxp3+ T cells. Furthermore, 
inhibition of eIF4E activity lead to upregulation of 
Foxp3 [55]. It seems plausible that similar post- 
transcriptional control is also associated with other 
Th subset differentiation programs. 

CONTRIBUTION OF T CELL 
ACTIVATION TO PROTEIN 
EXPRESSION AND LOCALIZATION 
DURING SUBSET 
DIFFERENTIATION 

Significant changes of gene expression occur in 
response to T cell activation through CD3/T cell 
receptor (TCR) complex and costimulatory recep- 
tors, correlating with initiation of cell cycle progres- 
sion, growth and metabolic changes. As naive T cells 
are generally quiescent and arrested at GO, they lack a 
number of proteins required for entry into and com- 
pletion of cell cycle [56, 57]. 

In an extensive recent work by Orr et al. [57], 
changes associated with activation and cell cycle ini- 
tiation were investigated both on a systemic level as 
well as in terms of proteins specifically recruited to 
chromatin or to the nuclear lamina. Among the total 
of 2894 proteins identified, 1724 were associated 
with the chromatin fraction. Notably, the entry to 
Gl stage of cell cycle was accompanied with a sig- 
nificant shift in protein expression, with the abun- 
dance of 772 proteins increased and 630 decreased. 
Of the chromatin or nuclear matrix-bound proteins, 
307 were increased and 211 decreased. Many of the 
induced proteins were involved in ribosome biogen- 
esis, reflecting the acceleration of protein synthesis 
and growth during cell cycle entry. Importantly, by 
silencing expression of splicing factor 3b, subunit 2 
(SF3B2) and eIF6 the authors showed that cell cycle 
progression is not obligatorily connected to cellular 
growth. Furthermore, contrary to previous know- 
ledge, the data indicated that xin actin-binding 
repeat containing 1 (XIRP1) protein is expressed 
in human T cells and inducibly associated with the 
chromatin fraction. The role of this relatively poorly 
characterized protein in T cell activation is not 
known. In addition to chromatin, the local micro- 
environment of the T cell receptor is of particular 
interest in the context of T cell activation. The 
proteome of this fraction was characterized in a 
recent targeted study by de Wet et al. [58]. The 
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authors identified proteins associated with the forma- 
tion of the immunological synapse, endocytosis of 
the TCR as well as negative regulators of activation. 

Importantly, full activation of T cells requires 
engagement of the T cell receptor and concomitant 
signalling through the costimulatory receptors, in 
particular, CD28. The pathways activated by the 
two stimuli are partially overlapping, both leading 
to activation of NFkB and NFAT [59]. The relative 
contribution of CD28 signalling to proteomic 
changes following T cell activation has been studied 
using 2DE. Kronfeld et al [60] have identified 35 
proteins regulated in response to either of two 
CD28 ligands, B7-1 or B7-2. These proteins were, 
in general associated with activation- and prolifer- 
ation-related processes. More recently, Lichtenfels 
etal. [61] have identified a panel of 52 proteins regu- 
lated by CD3, CD28 and the downstream effector, 
cytokine IL2. Again, the functional annotations of 
these proteins were in agreement with biological 
processes related to T cell activation, with 18 pro- 
teins annotated with metabolic processes, 15 with 
transport, seven with cytoskeleton organization and 
six with signal transduction. 

Altogether, T cell activation leads to considerable 
reorganization of the proteome, which to a great 
extent reflects the alterations of cellular morphology 
and metabolism. Many of these of changes have been 
characterized in studies of relatively late points of T 
cell activation, i.e. hours or days. For insights into 
the early events of activation, the role of post- 
translational modifications has to be taken into 
consideration. 



EARLY SIGNAL TRANSDUCTION 
BY PTMs 

PTMs provide rapid and reversible mechanisms for 
regulating protein activity. On average, each protein 
has been estimated to be a target for 2.5 modifica- 
tions [62]. In particular, phosphorylation of tyrosine, 
serine and threonine residues are widely used as 
means of signal transduction. Proteomics has the 
unique capacity to identify new PTM sites and quan- 
tify PTMs in a system- wide fashion. 

The main challenges for studies of PTMs, includ- 
ing phosphorylation, arise from the typically low 
ratio of the modified protein or peptide and the un- 
modified form. Recently, several mass spectrometry- 
compatible system-wide strategies allowing selective 
isolation of phosphorylated proteins or peptides have 



been developed [63]. These include immobilized 
metal affinity chromatography (IMAC), titanium di- 
oxide resins and immunoprecipitation with antibo- 
dies against phosphorylated amino acid residues, in 
particular, phospho tyrosine (pY). With these tools, 
more than 100-fold enrichment can be achieved 
[7 and references 93—95 therein]. The methods can 
also be used in combination. For example, the flow 
through fraction from Ti0 2 affinity chromatography 
can be further precipitated by anti-pY antibodies. 

Using global phosphopeptide enrichment strate- 
gies, thousands of phosphosites can be detected. 
However, large amounts of sample material are con- 
sumed in such analyses. Therefore, most T cell- 
related phosphoproteomics studies have been per- 
formed using the Jurkat T cell line [64] . In activated 
Jurkat cells, more than 10 000 phosphorylation sites 
have been identified [65] . With more limited mater- 
ial from primary CD4+ T cells, 2800 sites have been 
identified [66]. The Majority of the identified sites 
are in serine residues, followed by threonine and 
tyrosine at an approximate ratio of 88, 10 and 2%, 
respectively [67]. Despite the relative scarcity of 
phosphotyrosine sites, they are regarded as particu- 
larly important, as they have well-documented roles 
in regulating protein activity through both conform- 
ational change and creating specific binding sites for 
Src homology 2 (SH2) domains found in a diverse set 
of proteins involved in signal transduction [68]. 
Importantly, both proximal T cell receptor signalling 
as well as cytokine-responsive JAK-STAT pathways 
rely on successive tyrosine phosphorylation events. 
Consequently, a number of studies have concen- 
trated exclusively on the role of tyrosine phosphor- 
ylation in T cell signalling [69]. 

In several studies, quantitative changes in the 
phosphoproteome in response to T cell activation 
were measured. These studies have generally focused 
on the early time points following TCR engage- 
ment, as it has been shown that cellular phosphotyr- 
osine levels reach their maximum level already at 
1— 2min after activation, and phosphoserine and 
phosphothreonine at ^20min [66]. In the studies 
with the highest coverage so far, 696 phosphoryl- 
ation sites on 453 distinct proteins were found 
responsive to T cell receptor stimulation [65]. 
These included many novel phosphorylation sites 
on proteins with known roles in T cell activation 
as well as new sites on proteins previously not impli- 
cated in T cell activation. Such findings, together 
with the limited overlap of phosphosites identified 
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between studies suggest that the full complexity of T 
cell phosphoproteome has not yet been reached by 
current investigations. 

T cell responsive phosphoproteins are found in all 
major cellular compartments, although they are par- 
ticularly enriched in the plasma membrane [66]. 
Similarly, TCR-regulated phosphorylation events 
pervade all cellular processes associated with T cell 
activation. The largest group of these sites is related 
to activation-induced morphological changes, such 
as formation of immunological synapse and pattern- 
ing of T cell surface proteins. Interestingly, proteins 
annotated with transcriptional regulatory functions 
seem in general to be dephosphorylated in response 
to activation [66]. 

Global phosphoproteome responses to key cyto- 
kines promoting differentiation of Th subsets have 
not been characterized. Recently, Osinalde et al 
applied phosphotyrosine immunoprecipitation on 
cells of the Kit225 T cell line, stimulated for 5 min 
with IL2 [70]. The authors reported 56 proteins 
reproducibly enriched in the immunoprecipitated 
fraction. For 33 of these proteins, specific phospho- 
tyrosine sites could be identified, other proteins 
being likely to co-precipitate due to protein-protein 
interactions with the phosphorylated proteins. 
Accordingly, the identified proteins were enriched 
for SH2 domains. Functionally, the identified pro- 
teins were enriched for categories including cellular 
growth and proliferation well in line with the estab- 
lished functions of IL2 and the IL2-dependent nature 
of the Kit225 cell line. 

In contrast with immortalized cell lines, studies of 
cytokine signalling using primary Th cells in general 
require additional stimulation of the TCR, as T cell 
activation is necessary for cytokine-induced subset 
differentiation as well as for expression of receptors 
for several cytokines including IL2. However, as T 
cell activation leads to widespread phosphorylation 
of abundant proteins, an important challenge will be 
detecting the events induced specifically by cytokine 
signalling, and determining the extent of cross-talk 
between the T cell activation and cytokine-induced 
pathways. 

An important trend emerging from the findings of 
such system-wide studies is the close relationship of 
protein phosphorylation and protein— protein inter- 
actions. While binding of phosphorylated tyrosine 
residues with SH2 domains has been recognized for 
long, similar mechanisms seem to frequently play 
roles also in serine and threonine phosphorylation. 



More specifically, it has been shown that threonine 
and serine residues phosphorylated in response to 
TCR activation are enriched in protein domains 
that are part of protein— protein interaction surfaces 
[65]. Activation-induced phosphorylation can influ- 
ence formation of interaction either negatively or 
positively. For example, expression of Fos and 112 
is promoted by serine phosphorylation-induced dis- 
ruption of interactions between co-repressors nuclear 
receptor co-repressor 2 (NCOR2), Brahma protein 
homolog (BRG1), and heterochro matin protein 1 
homolog (HPlof) and their respective transcription 
factors. In contrast, patterning of T cell surface mol- 
ecules associated with T cell activation is dependent 
on phosphorylation-induced interactions [65]. 

Finally, it is becoming increasingly evident that 
phosphorylation is not the only form of PTMs 
involved in T cell signalling. Activity of STAT pro- 
teins is, in addition to phosphorylation, controlled by 
lysine acetylation [71]. Arginine methylation has 
been reported to be involved in processes controlling 
expression of Th2 cytokine locus [72]. While anti- 
body-based methods for global enrichment of both 
acetylated and methylated peptides exist, up to this 
date, they have not been applied in T cell research 
[73]. Importantly, the challenges associated with 
PTM mapping are multiplied when combinatorial 
effects of modifications are taken into account [74]. 

REGULATION OF 
LINEAGE-SPECIFYING 
TRANSCRIPTION FACTORS BY 
PROTEIN-PROTEIN 
INTERACTIONS 

Functional activity of proteins involves transient pro- 
tein-protein interactions and formation of stable 
protein complexes. With affinity purification and 
mass spectrometry, binding partners of a protein 
can be interrogated in an untargeted fashion, offering 
important insight to protein function [75]. 
Currently, the ultimate goal of interactome map- 
ping, cataloguing the full network of protein— protein 
interactions, remains to be completed in mammalian 
cells. Furthermore, protein— protein interactions are 
subject to context-specific regulation. A pair of pro- 
teins can therefore, in different conditions, exhibit 
varying tendency to interact due to changes in regu- 
latory signals or stoichiometric ratios of binding part- 
ners. Therefore, it is crucial to study protein-protein 
interactions in specific biological systems. 
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In the context of Th cell differentiation, of key 
importance are the interactions involving the lineage- 
specifying transcription factors, as their DNA binding 
capacity and regulatory potential can be modified by a 
variety of co-factors. For example, GATA3 has been 
reported to form two distinct complexes, GATA3/ 
CHD4/p300 and GATA3/CHD4/NURD [76]. 
The former is found at Th2 cytokine loci and induces 
transcription, whereas the latter is repressive and found 
at Tbx21 locus. Furthermore, the lineage-specifying 
Treg transcription factor FOXP3 has been identified 
as part of large (400— 2000 kDa) protein complexes, 
with more than 300 potential binding partners [77]. 
Notably, FOXP3 directly regulates the expression of 
many of these co-factors. The opposing regulatory 
roles of some co-factors suggest that FOXP3 can 
form functionally distinct complexes in response to 
specific cues. This finding provides a potential mech- 
anistic explanation for the functional heterogeneity 
associated with Treg cells. 

Interestingly, the lineage-specifying transcription 
factors can also in many cases interact with each 
other. GATA3 interacts with tyrosine-phosphory- 
lated T-bet, negatively influencing its binding to 
DNA and thus antagonizing Thl differentiation 
[78]. T-bet, in turn, is able to interact with the Tfh- 
promoting BCL6 and coordinately regulate gene ex- 
pression [79]. With FOXP3, remarkably many of the 
identified interacting proteins are transcriptional 
regulators implicated in Th cell differentiation, 
including GAT A3, RUNX1, NFAT, STAT3 and 
IKZF1 [77]. Altogether, it is turning out that the lin- 
eage-specifying transcription factors form a densely 
connected regulatory network. The physical inter- 
actions between the nodes of this network are likely 
to provide an important component in regulation of 
Th cell subset differentiation and plasticity [80] . More 
detailed elucidation of kinetics of such interactions in 
response to TCR and cytokine stimulation could pro- 
vide further insight into these processes. 



FUTURE OUTLOOK 

Proteomics is a field in rapid development and in 
many ways its full potential remains to be exploited. 
In particular, studies with limited sample material, 
e.g. from primary cell cultures, have suffered from 
the restricted sensitivity and dynamic range of mass 
spectrometry-based approaches. 

In the context of Th cell differentiation, many of 
the key proteins, such as transcription factors, kinases 



and regulators of calcium signalling or cell adhesion 
are found at low copy numbers (<5000 copies per 
cell) [81]. Proteome level investigation of many 
biologically crucial processes would thus require 
reproducible detection and quantification of such 
low abundance proteins. In addition, improvements 
in sensitivity would enable analyses with smaller 
amounts of sample material. For example, in 
the case of primary cells, the sample could potentially 
be analysed as biologically meaningful subpopulations 
defined by expression of surface markers. Notably, 
heterogeneity within Th cell responses is becoming 
increasingly appreciated, and studying the Th cell 
subpopulation structures with a greater resolution 
could in many cases lead to greater biological insight. 

The ultimate goal for analytical sensitivity is the 
ability to study proteomes derived from single cells. 
Such analyses would take into account the ubiqui- 
tous cellular heterogeneity and describe nature of 
responses to external stimuli more accurately than 
studies relying solely on population-level averages. 
Although the important milestone of single-cell 
resolution has recently been reached by transcrip- 
tomics, the challenges involved in a similar prote- 
omics strategy are substantial. All current single-cell 
transcriptomics approaches require significant ampli- 
fication of starting material — something not possible 
at the protein level. Although each mRNA molecule 
on the average gives rise to 2800 proteins [54], the 
single cell proteomics analysis would still be per- 
formed with a very limited number of molecules. 
Furthermore, cell lysis, digestion and sample frac- 
tionation would have to be extremely efficient and 
reproducible. To date, single-cell mass spectrometry 
experiments have only been reported with highly 
abundant peptides, lipids or metabolites [82]. Argu- 
ably, the most powerful tools for high-throughput 
single-cell protein-level investigation are currently 
offered by multi-parameter flow cytometry and the 
recently developed mass cytometry [83]. However, 
these approaches are based on the use of antibodies 
with all the associated theoretical and practical 
limitations. 

Importantly, detection sensitivity is closely related 
to the concept of dynamic range. The concentrations 
of cellular proteins vary over seven orders of magni- 
tude [54, 81, 84]. Thus, a pool of tryptic peptides 
from a complete cellular proteome will be domi- 
nated by peptides from the highly abundant proteins. 
A mass spectrometer operating in a typical data-de- 
pendent acquisition mode is thus prone to repeatedly 
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sampling ions from relatively few common proteins, 
and missing the more rare ions entering the instru- 
ment simultaneously. The problem has been partially 
alleviated by the faster duty cycles, i.e. sampling 
times used in modern instruments. In addition, 
sample can be subjected to extensive chromato- 
graphic fractionation prior to analysis, which how- 
ever can also lead to sample loss [85]. Alternatively, 
with prior information of the sample, assays can be 
designed, which target and quantify specific peptides 
of interest. With these techniques, generally called 
selective reaction monitoring (SRM) or multiple re- 
action monitoring (MRM), individual peptides are 
selected and fragmented, typically using a triple 
quadrupole instrument and resulting fragment ions 
of expected mass to charge ratio (referred to as tran- 
sitions) are quantified [86]. In yeast, the dynamic 
range of a single SRM-based analysis has been 
demonstrated to span six orders of magnitude, from 
1.3 million to <50 copies per cell [87]. However, 
whereas high sensitivity and dynamic range can be 
achieved, designing and validating the assays do re- 
quire considerable efforts. Recently, a conceptually 
related global analysis strategy, SWATH-MS, was 
introduced. With this data-independent acquisi- 
tion-based strategy, dynamic range of four orders 
of magnitude was reached [88]. 

Besides improvements in sensitivity, studies of Th 
cell biology would benefit from straightforward 
methods for absolute protein quantification. 
Isobaric peptide labels are typically used to determine 
relative quantities of a peptide or protein between 
two samples with no information gained on abun- 
dance differences between protein species. 
According to recent reports, Th cell lineage specifi- 
cation is likely to result from quantitative effects of 
opposing transcription factors [80, 89, 90]. 
Moreover, concentration of a transcription factor is 
typically correlated with DNA occupancy. Thus, de- 
termination of protein copy numbers per cell would 
enhance explanatory power of studies targeting regu- 
latory networks leading to Th differentiation. 
Although strategies for absolute quantification using 
either spike in peptides [91] or bioinformatics analysis 
[92] have been developed, neither are yet routinely 
used. Notably, in a recent investigation Simicevic 
et al [93] used SRM-based workflow to determine 
absolute quantities of 10 key transcription factors 
involved in adipocyte differentiation, revealing that 
the copy numbers vary considerably, from ^250 to 
over 300 000 molecules per cell [93]. Similar 



approaches would likely prove to be informative 
also in the case of T cells. 

In conclusion, several features of Th cell differen- 
tiation remain to be elucidated at the protein level. 
These include the regulatory network of lineage- 
specifying transcription factors, in particular, 
determination of absolute copy numbers, post- 
translational modifications, mutual interactions and 
temporal regulation in subset polarizing conditions. 
The rapid technical development of mass spectrom- 
etry together with the increasing rate of adoption of 
proteomics methods among immunologists offer 
promise that many of these questions can soon be 
addressed. 
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